Indiana County
Structure and Destructure: Dual Forces in the Making of Knowledge Engines
The making of knowledge engines in natural language processing has been shaped by two seemingly distinct paradigms: one grounded in structure, the other driven by massively available unstructured data. The structured paradigm leverages predefined symbolic interactions, such as knowledge graphs, as priors and designs models to capture them. In contrast, the unstructured paradigm centers on scaling transformer architectures with increasingly vast data and model sizes, as seen in modern large language models. Despite their divergence, this thesis seeks to establish conceptual connections bridging these paradigms. Two complementary forces, structure and destructure, emerge across both paradigms: structure organizes seen symbolic interactions, while destructure, through periodic embedding resets, improves model plasticity and generalization to unseen scenarios. These connections form a new recipe for developing general knowledge engines that can support transparent, controllable, and adaptable intelligent systems.
- North America > United States > New York > New York County > New York City (0.14)
- Europe > Bulgaria (0.04)
- North America > United States > California > Los Angeles County > Los Angeles (0.04)
- (25 more...)
- Research Report > New Finding (1.00)
- Personal > Honors (1.00)
- Overview (1.00)
- Media > Film (1.00)
- Leisure & Entertainment > Sports (1.00)
- Health & Medicine > Therapeutic Area > Neurology (1.00)
- (4 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- (8 more...)
Manifold Learning for Hyperspectral Images
Harkat, Fethi, Deuberet, Tiphaine, Gey, Guillaume, Perrier, Valérie, Polisano, Kévin
Abstract--Traditional feature extraction and projection techniques, such as Principal Component Analysis, struggle to adequately represent X-Ray Transmission (XRT) Multi-Energy (ME) images, limiting the performance of neural networks in decision-making processes. To address this issue, we propose a method that approximates the dataset topology by constructing adjacency graphs using the Uniform Manifold Approximation and Projection. This technique not only preserves the global structure of the data but also enhances feature separability, leading to more accurate and robust classification results. Figure 1: Scheme of the experimental setting (left) view from aside. A top view of the detector is also given (right) to make I. Recent advances in Hyperspectral Images (HSI) analysis have primarily focused on reflection spectroscopy in the visible or near-infrared light domains.
- Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.05)
- North America > United States > Pennsylvania > Indiana County (0.04)
- North America > United States > Indiana > Tippecanoe County (0.04)
- (2 more...)
A Racing Dataset and Baseline Model for Track Detection in Autonomous Racing
Ghosh, Shreya, Chen, Yi-Huan, Huang, Ching-Hsiang, Jameel, Abu Shafin Mohammad Mahdee, Ho, Chien Chou, Gamal, Aly El, Labi, Samuel
--A significant challenge in racing-related research is the lack of publicly available datasets containing raw images with corresponding annotations for the downstream task. In this paper, we introduce RoRaTrack, a novel dataset that contains annotated multi-camera image data from racing scenarios for track detection. The data is collected on a Dallara A V-21 at a racing circuit in Indiana, in collaboration with the Indy Autonomous Challenge (IAC). RoRaTrack addresses common problems such as blurriness due to high speed, color inversion from the camera, and absence of lane markings on the track. Consequently, we propose RaceGAN, a baseline model based on a Generative Adversarial Network (GAN) that effectively addresses these challenges. The proposed model demonstrates superior performance compared to current state-of-the-art machine learning models in track detection. The dataset and code for this work are available at github.com/RaceGAN. Modern vehicles are increasingly equipped with a range of computer vision technologies to assist drivers and improve road safety. A critical application of these technologies, particularly for autonomous and self-driving vehicles, is lane detection, which ensures that vehicles remain within designated lanes [1]. Lane detection systems not only help maintain proper lane alignment, but also provide visual cues to drivers about lane boundaries. Similarly, autonomous technologies are being integrated into race cars, giving rise to the emerging field of autonomous racing. In this domain, vehicles operate entirely without human intervention, relying solely on artificial intelligence and computer vision algorithms [2].
- North America > United States > Pennsylvania > Indiana County (0.04)
- North America > United States > Indiana > Putnam County (0.04)
Unsupervised Machine Learning for Detecting and Locating Human-Made Objects in 3D Point Cloud
Zhao, Hong, Huang, Huyunting, Zhang, Tonglin, Yang, Baijian, Wei-Kocsis, Jin, Fei, Songlin
A 3D point cloud is an unstructured, sparse, and irregular dataset, typically collected by airborne LiDAR systems over a geological region. Laser pulses emitted from these systems reflect off objects both on and above the ground, resulting in a dataset containing the longitude, latitude, and elevation of each point, as well as information about the corresponding laser pulse strengths. A widely studied research problem, addressed in many previous works, is ground filtering, which involves partitioning the points into ground and non-ground subsets. This research introduces a novel task: detecting and identifying human-made objects amidst natural tree structures. This task is performed on the subset of non-ground points derived from the ground filtering stage. Marked Point Fields (MPFs) are used as models well-suited to these tasks. The proposed methodology consists of three stages: ground filtering, local information extraction (LIE), and clustering. In the ground filtering stage, a statistical method called One-Sided Regression (OSR) is introduced, addressing the limitations of prior ground filtering methods on uneven terrains. In the LIE stage, unsupervised learning methods are lacking. To mitigate this, a kernel-based method for the Hessian matrix of the MPF is developed. In the clustering stage, the Gaussian Mixture Model (GMM) is applied to the results of the LIE stage to partition the non-ground points into trees and human-made objects. The underlying assumption is that LiDAR points from trees exhibit a three-dimensional distribution, while those from human-made objects follow a two-dimensional distribution. The Hessian matrix of the MPF effectively captures this distinction. Experimental results demonstrate that the proposed ground filtering method outperforms previous techniques, and the LIE method successfully distinguishes between points representing trees and human-made objects.
- North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
- North America > United States > Pennsylvania > Indiana County (0.04)
- North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
- (2 more...)
Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives
Grauman, Kristen, Westbury, Andrew, Torresani, Lorenzo, Kitani, Kris, Malik, Jitendra, Afouras, Triantafyllos, Ashutosh, Kumar, Baiyya, Vijay, Bansal, Siddhant, Boote, Bikram, Byrne, Eugene, Chavis, Zach, Chen, Joya, Cheng, Feng, Chu, Fu-Jen, Crane, Sean, Dasgupta, Avijit, Dong, Jing, Escobar, Maria, Forigua, Cristhian, Gebreselasie, Abrham, Haresh, Sanjay, Huang, Jing, Islam, Md Mohaiminul, Jain, Suyog, Khirodkar, Rawal, Kukreja, Devansh, Liang, Kevin J, Liu, Jia-Wei, Majumder, Sagnik, Mao, Yongsen, Martin, Miguel, Mavroudi, Effrosyni, Nagarajan, Tushar, Ragusa, Francesco, Ramakrishnan, Santhosh Kumar, Seminara, Luigi, Somayazulu, Arjun, Song, Yale, Su, Shan, Xue, Zihui, Zhang, Edward, Zhang, Jinxu, Castillo, Angela, Chen, Changan, Fu, Xinzhu, Furuta, Ryosuke, Gonzalez, Cristina, Gupta, Prince, Hu, Jiabo, Huang, Yifei, Huang, Yiming, Khoo, Weslie, Kumar, Anush, Kuo, Robert, Lakhavani, Sach, Liu, Miao, Luo, Mi, Luo, Zhengyi, Meredith, Brighid, Miller, Austin, Oguntola, Oluwatumininu, Pan, Xiaqing, Peng, Penny, Pramanick, Shraman, Ramazanova, Merey, Ryan, Fiona, Shan, Wei, Somasundaram, Kiran, Song, Chenan, Southerland, Audrey, Tateno, Masatoshi, Wang, Huiyu, Wang, Yuchen, Yagi, Takuma, Yan, Mingfei, Yang, Xitong, Yu, Zecheng, Zha, Shengxin Cindy, Zhao, Chen, Zhao, Ziwei, Zhu, Zhifan, Zhuo, Jeff, Arbelaez, Pablo, Bertasius, Gedas, Crandall, David, Damen, Dima, Engel, Jakob, Farinella, Giovanni Maria, Furnari, Antonino, Ghanem, Bernard, Hoffman, Judy, Jawahar, C. V., Newcombe, Richard, Park, Hyun Soo, Rehg, James M., Sato, Yoichi, Savva, Manolis, Shi, Jianbo, Shou, Mike Zheng, Wray, Michael
We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike repair). More than 800 participants from 13 cities worldwide performed these activities in 131 different natural scene contexts, yielding long-form captures from 1 to 42 minutes each and 1,422 hours of video combined. The multimodal nature of the dataset is unprecedented: the video is accompanied by multichannel audio, eye gaze, 3D point clouds, camera poses, IMU, and multiple paired language descriptions -- including a novel "expert commentary" done by coaches and teachers and tailored to the skilled-activity domain. To push the frontier of first-person video understanding of skilled human activity, we also present a suite of benchmark tasks and their annotations, including fine-grained activity understanding, proficiency estimation, cross-view translation, and 3D hand/body pose. All resources will be open sourced to fuel new research in the community.
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.13)
- (23 more...)
- Research Report > New Finding (1.00)
- Instructional Material (1.00)
- Leisure & Entertainment > Sports > Soccer (1.00)
- Education > Educational Setting (0.92)
- Leisure & Entertainment > Sports > Basketball (0.92)
- (4 more...)
Compositional Generalization for Data-to-Text Generation
Xu, Xinnuo, Titov, Ivan, Lapata, Mirella
Data-to-text generation involves transforming structured data, often represented as predicate-argument tuples, into coherent textual descriptions. Despite recent advances, systems still struggle when confronted with unseen combinations of predicates, producing unfaithful descriptions (e.g. hallucinations or omissions). We refer to this issue as compositional generalisation, and it encouraged us to create a benchmark for assessing the performance of different approaches on this specific problem. Furthermore, we propose a novel model that addresses compositional generalization by clustering predicates into groups. Our model generates text in a sentence-by-sentence manner, relying on one cluster of predicates at a time. This approach significantly outperforms T5~baselines across all evaluation metrics.Notably, it achieved a 31% improvement over T5 in terms of a metric focused on maintaining faithfulness to the input.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
- Asia > Middle East > Republic of Türkiye > İzmir Province > İzmir (0.05)
- (23 more...)
Indiana man allegedly kills, dismembers father after believing him to be robot: 'Had to shoot at it'
Fox News contributor Leo Terrell joined'America's Newsroom' to discuss why crime is surging nationwide and how'parental involvement' can reverse the dangerous trend. An Indiana man was slapped with 10 charges after he allegedly fatally shot his father and dismembered his corpse after believing him to be a robot. Shawn Hays, 53, of Lawrence County, Indiana, was arrested Dec. 20 after deputies responded to a welfare check call on his 73-year-old father Rodney Hays, according to a probable cause affidavit cited by local Fox affiliate WXIN. The person who called the police informed them that Hays told them that he had shot and mutilated his father because he had been turned into a robot. Shawn Hays, 53, was slapped with 10 charges after he allegedly fatally shot his father and dismembered his corpse after believing him to be a robot.
- North America > United States > Pennsylvania > Indiana County (0.26)
- North America > United States > Indiana > Lawrence County (0.26)
- North America > United States > South Carolina (0.06)
- North America > United States > North Carolina (0.06)
- Law > Criminal Law (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Leveraging Wastewater Monitoring for COVID-19 Forecasting in the US: a Deep Learning study
Fazli, Mehrdad, Shakeri, Heman
The outburst of COVID-19 in late 2019 was the start of a health crisis that shook the world and took millions of lives in the ensuing years. Many governments and health officials failed to arrest the rapid circulation of infection in their communities. The long incubation period and the large proportion of asymptomatic cases made COVID-19 particularly elusive to track. However, wastewater monitoring soon became a promising data source in addition to conventional indicators such as confirmed daily cases, hospitalizations, and deaths. Despite the consensus on the effectiveness of wastewater viral load data, there is a lack of methodological approaches that leverage viral load to improve COVID-19 forecasting. This paper proposes using deep learning to automatically discover the relationship between daily confirmed cases and viral load data. We trained one Deep Temporal Convolutional Networks (DeepTCN) and one Temporal Fusion Transformer (TFT) model to build a global forecasting model. We supplement the daily confirmed cases with viral loads and other socio-economic factors as covariates to the models. Our results suggest that TFT outperforms DeepTCN and learns a better association between viral load and daily cases. We demonstrated that equipping the models with the viral load improves their forecasting performance significantly. Moreover, viral load is shown to be the second most predictive input, following the containment and health index. Our results reveal the feasibility of training a location-agnostic deep-learning model to capture the dynamics of infection diffusion when wastewater viral load data is provided.
- North America > United States > Virginia > Albemarle County > Charlottesville (0.14)
- North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
- Oceania > Australia (0.04)
- (19 more...)
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding
Suzgun, Mirac, Melas-Kyriazi, Luke, Jurafsky, Dan
In open-ended natural-language generation, existing text decoding methods typically struggle to produce text which is both diverse and high-quality. Greedy and beam search are known to suffer from text degeneration and linguistic diversity issues, while temperature, top-k, and nucleus sampling often yield diverse but low-quality outputs. In this work, we present crowd sampling, a family of decoding methods based on Bayesian risk minimization, to address this diversity-quality trade-off. Inspired by the principle of "the wisdom of the crowd," crowd sampling seeks to select a candidate from a pool of candidates that has the least expected risk (i.e., highest expected reward) under a generative model according to a given utility function. Crowd sampling can be seen as a generalization of numerous existing methods, including majority voting, and in practice, it can be used as a drop-in replacement for existing sampling methods. Extensive experiments show that crowd sampling delivers improvements of 3-7 ROUGE and BLEU points across a wide range of tasks, including summarization, data-to-text, translation, and textual style transfer, while achieving new state-of-the-art results on WebNLG and WMT'16.
- North America > United States > Wisconsin > Outagamie County > Appleton (0.14)
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- Asia > Nepal > Bagmati Province > Kathmandu District > Kathmandu (0.04)
- (35 more...)
- Research Report > New Finding (0.68)
- Personal > Obituary (0.46)
Soundscape Ecology: The Science of Sound in the Landscape
Note that the Buckeye Flats location (a) contains greater acoustic activity, a result of the nearby rapid flowing stream that produced considerable geophonic sounds. The inset (b) graphs the same data but with Buckeye Flats removed. These values (b) reflect mostly biophony. Sycamore Creek contained the greatest acoustic activity of these three. The fall contains the greatest activity although there was no consistent pattern across sites. Photos of each landscape are provided in (c).
- Europe > Italy > Tuscany (0.05)
- North America > Costa Rica (0.04)
- North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
- (12 more...)
- Government (0.68)
- Energy (0.46)
- Health & Medicine (0.46)